CDS

Accession Number TCMCG073C21617
gbkey CDS
Protein Id XP_010546727.1
Location join(1660813..1660986,1661124..1661261,1661464..1661545,1661695..1661771,1661866..1661981,1662060..1662177,1662290..1662406,1662618..1662709,1662809..1662875,1663127..1663208,1663344..1663407,1663487..1663526)
Gene LOC104818720
GeneID 104818720
Organism Tarenaya hassleriana

Protein

Length 388aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268022
db_source XM_010548425.2
Definition PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform X3 [Tarenaya hassleriana]

EGGNOG-MAPPER Annotation

COG_category S
Description Protein of unknown function (DUF1624)
KEGG_TC -
KEGG_Module M00078        [VIEW IN KEGG]
KEGG_Reaction R07815        [VIEW IN KEGG]
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K10532        [VIEW IN KEGG]
EC 2.3.1.78        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00531        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko04142        [VIEW IN KEGG]
map00531        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGATTTCTTTTCCGTTGAAACACCCGAGAAAGAAAATGGCGGAAATCAAACCAGAGACCAGCAGCGACCATCAACGCCTCGTCGTCGTGTCGGAACCCGTCGACCCGAATCGGAAACTCACCGGAAGCAAACGACTTGCCTCTCTCGATGTCTTCCGTGGTCTCGCCGTCGCTTTGATGATTCTGGTGGATGACGCCGGCGGAGAATGGCCGGCGATTGCACACGCGCCCTGGCATGGCTGCAACCTGGCGGATTTCGTCATGCCTTTCTTCTTGTTCATCGTCGGCGTTTCCATTGCTCTTGCTCTCAAGAGAATTGGAAACAAATTCGAAGCTATAAAGAAGGTGGTTCTTAGGACATGCAAGCTCCTCTTCTGGGGTCTTCTACTTCAAGGGGGCTTCTCTCATGCTCCCGATAGATTAACGTACGGTGTTGATGTGAATATGATGAGGTTGTGCGGGATTCTCCAGAGAATAGCCTTATCGTACTTGATAGTCGCGTTGGTGGAGATTTTCACAATGGATTCACGGAGGGAGAATCTCTCGAATGGACTGTTCTCGATATTCAAGTCATATTATTGGCATTGGCTTGTGGGCGCAACTGTTCTTGTCATTTATCTGGCTACGCTTTACGGAACTTACGTACCGGACTGGCGATTCATTGTATACGATAGAGACAGCATTCTGTACGGGAAAACTCTCTCTGTATTGTGTGGTGTGAGGGGAAAGCTTGATCCTCCCTGTAATGCTGTTGGATATATCGACAGACAGCTTCTGGGGCTCAACCATATGTATCAGCATCCAGCATGGAAAAGATCCAAGGCTTGCACCTATGACTCCCCTTATGAGGGGCCTTTACGTAAAGATGCGCCTTCATGGTGCCATGCGCCATTTGAGCCAGAAGGAGTGTTAAGTTCCATATGTGCTGTTCTGTCTACAATCATCGGAGTTCATTTTGGACATGTTATCTTACACTTCAAGGGTCATTCAGCTCGGTTGAAGCACTGGATCTCCACTGGTATCGCTCTCCTCGTTCTCGGGCTCACCCTGCATTTCACCCACCTTTCAGCTACACTTGCGTCACTTCTGGAGCAGCGGCCCTCGTCTTCTCCTCATTTTATGCTCTGGTCGACATATGGGGCTGGAAGTACGTGTTCCTGCCATTGA
Protein:  
MISFPLKHPRKKMAEIKPETSSDHQRLVVVSEPVDPNRKLTGSKRLASLDVFRGLAVALMILVDDAGGEWPAIAHAPWHGCNLADFVMPFFLFIVGVSIALALKRIGNKFEAIKKVVLRTCKLLFWGLLLQGGFSHAPDRLTYGVDVNMMRLCGILQRIALSYLIVALVEIFTMDSRRENLSNGLFSIFKSYYWHWLVGATVLVIYLATLYGTYVPDWRFIVYDRDSILYGKTLSVLCGVRGKLDPPCNAVGYIDRQLLGLNHMYQHPAWKRSKACTYDSPYEGPLRKDAPSWCHAPFEPEGVLSSICAVLSTIIGVHFGHVILHFKGHSARLKHWISTGIALLVLGLTLHFTHLSATLASLLEQRPSSSPHFMLWSTYGAGSTCSCH